Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 ML Hardware
GPU, TPU, inference hardware, AI accelerators, CUDA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
11298
posts in
12.2
ms
The $500 GPU That
Outperforms
Claude
Sonnet
on Coding Benchmarks
⚡
Performance Engineering
dev.to
·
5d
·
DEV
·
…
Lemonade
by AMD: a fast and open source local LLM server using GPU and
NPU
🧠
LLMs
lemonade-server.ai
·
4h
·
Hacker News
·
…
Improving Efficiency of GPU
Kernel
Optimization Agents using a
Domain-Specific
Language and Speed-of-Light Guidance
⚡
Performance Engineering
arxiv.org
·
1d
·
…
Apple Now Selling
Refurbished
M5
MacBook Pro and iPad 11 at Reduced Prices
🍎
Apple
macrumors.com
·
20h
·
r/apple
·
…
Mold – local AI image generation CLI (FLUX,
SDXL
,
SD1.5
, 8 families)
✍️
Prompt Engineering
utensils.io
·
1d
·
r/StableDiffusion
·
…
soy-tuber/nemotron
: Local multimodal LLM gateway unifying NVIDIA
Nemotron
models on a single GPU
🧠
LLMs
github.com
·
6d
·
r/LocalLLaMA
·
…
Question - will AMD
RDNA
5
AT0
release for gamers
⚡
Performance Engineering
forums.anandtech.com
·
2d
·
…
🎲
Setting
Up a Local LLM
🧠
LLMs
blog.miloslavhomer.cz
·
4d
·
…
Microsoft at
KubeCon
2026 —
DRA
GA, AI Runway, and Kubernetes as AI Infrastructure OS
☁️
Cloud Computing
manoit.co.kr
·
5d
·
DEV
·
…
WebNN
Has a Free API You've Never
Heard
Of
💬
NLP
apify.com
·
5d
·
DEV
·
…
How I
organize
26
microservices
on one GPU without losing my ADHD mind
☁️
Cloud Computing
drakeent.gumroad.com
·
4d
·
DEV
·
…
I
Couldn
’t
Debug
My AI/ML GPU Incident
⚡
Performance Engineering
dev.to
·
11h
·
DEV
·
…
Opinion on
upgrading
GPU's
⚡
Performance Engineering
forums.anandtech.com
·
1d
·
…
ScaleOps
raises $
130M
to improve computing efficiency amid AI demand
🏗️
System Design
techcrunch.com
·
3d
·
…
Robust Batch-Level Query
Routing
for Large Language Models under Cost and Capacity
Constraints
🤖
LLM
arxiv.org
·
2d
·
…
Show HN:
Hollow
–
serverless
web perception for AI agents
🕵️
AI Agents
artiqal.vercel.app
·
5d
·
Hacker News
·
…
Secure Bare Metal AI APIs: Defeat GPU
Hijacking
& Docker
UFW
Bypass
🦀
Rust
servermo.com
·
5d
·
DEV
·
…
You guys seen this? 1-bit model with an
MMLU-R
of 65.7, 8B
params
⚡
Performance Engineering
huggingface.co
·
1d
·
r/LocalLLaMA
·
…
ChatGPT Won't Let You Type Until Cloudflare
Reads
Your React State. I
Decrypted
the Program That Does It.
🦀
Rust
buchodi.com
·
3d
·
Lobsters
,
Hacker News
,
r/browsers
·
…
124x Slower: What
PyTorch
DataLoader
Actually Does at the Kernel Level
⚡
Performance Engineering
dev.to
·
22h
·
DEV
·
…
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help